Text-Dependent Multilingual Speaker Identification using Learning Vector Quantization and PSO-GA Hybrid Model

نویسندگان

PRIYATOSH MISHRA

PANKAJ KUMAR MISHRA

چکیده

---------------------------------------------------------------------***--------------------------------------------------------------------Abstract In this work a multilingual speaker identification system is proposed. The feature extraction techniques employed in system extract Mel frequency cepstral coefficient (MFCC), delta mel frequency cepstral coefficient (DMFCC) and format frequency. The feature selection is done using hybrid model of particle swarm optimizatiom (PSO) and Genetic Algorithm (GA). We have used Learning Vector Quantization (LVQ) artificial Neural Network classifiers. The speech database consists of 40 speakers (20 males+ 20 females) speech utterance. The speech utterance is recorded for a specific sentence in three different languages viz. “Now this time you go” (in English), “Adhuna Asmin Twam Gachh “(in Sanskrit), “Ab Iss Baar Tum Jao” (in Hindi). Total word for this purpose is 14 including 4 for Sanskrit and 5 Hindi and English. The average identification rate 79.99% is achieved when the Network is trained by LVQ and it shows 80.52% when LVQ is trained using hybrid PSO-GA model.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Text-Dependent Multilingual Speaker Identification using Back Propagation Neural Network and PSO-GA Hybrid Model

In this work a multilingual speaker identification system is proposed. The feature extraction techniques employed in the system extract Mel frequency cepstral coefficient (MFCC), delta mel frequency cepstral coefficient (DMFCC) and format frequency. The feature selection is done using hybrid model of particle swarm optimizatiom (PSO) and Genetic Algorithm (GA). We have used Back Propagation (BP...

متن کامل

Speaker Recognition Using Gaussian Mixtures Models

Speech signal contains several levels of information. At first it contains information about the spoken message. At second level speech signal also gives information about the speaker identity, his emotional state and so on. The task of speaker recognition can be divided into two parts: speaker identification and speaker verification. Speaker identification is answering the question which one o...

متن کامل

Speaker Identification From Youtube Obtained Data

An efficient, and intuitive algorithm is presented for the identification of speakers from a long dataset (like YouTube long discussion, Cocktail party recorded audio or video).The goal of automatic speaker identification is to identify the number of different speakers and prepare a model for that speaker by extraction, characterization and speaker-specific information contained in the speech s...

متن کامل

Modified Technique for Speaker Recognition using ANN

Speaker recognition consists of three phases: pre-processing, feature extraction and classification. During the first phase, the computer records the voice pattern of the speaker and analyse it. By the end of the second phase, the main features of the voice pattern are extracted. In the third phase, many classification techniques are exist such as artificial neural network (ANN) , hidden Markov...

متن کامل

Speaker identification using discriminative features selection

A new method of text-dependent speaker identification using discriminative feature selection is proposed in this paper. The characteristics of the proposed method are as follows: feature parameters extraction, vector quantization with the growing neural gas (GNG) algorithm, model building using gaussian distributions and discriminative feature selection (DFS) according to the uniqueness of pers...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2016

Text-Dependent Multilingual Speaker Identification using Learning Vector Quantization and PSO-GA Hybrid Model

نویسندگان

چکیده

منابع مشابه

Text-Dependent Multilingual Speaker Identification using Back Propagation Neural Network and PSO-GA Hybrid Model

Speaker Recognition Using Gaussian Mixtures Models

Speaker Identification From Youtube Obtained Data

Modified Technique for Speaker Recognition using ANN

Speaker identification using discriminative features selection

عنوان ژورنال:

اشتراک گذاری